ATTORNEY DOCKET NO.: GENE-106/02US 

Appl.No.: 10/633,306 
Page 2 



Amendments to the Claims: 

This listing of claims will replace all prior versions and listings of claims in the 

applications. 
Listing of the Claims: 

1 . (Currently Amended) A method of identifying one or more marker genes whose level of 
expression predicts a biological response in a cell or tissue, comprising: 

(a) obtaining gene expression data bv at least one hybridization assay for at least a test 
cell or tissue group and a control cell or tissue group, each of the test and control groups 
comprising a collection of cell or tissue samples; 

(b) analyzing the gene expression data to identify said one or more marker genes by a 
linear discriminate metric that does not include a variable corresponding to a factor selected from 
the group consisting of: the magnitude of the difference in gene expression for each gene 
between the test and control groups, and the behavior or identity of other genes detected in the 
samples or across samples and combinations thereof, thereby identifying one or more marker 
genes whose level of expression predicts a biological response in a cell or tissue. 

2. (Currently Amended) [[A]] The method of claim 1 , wherein the test cell or tissue group 
is exposed to a toxin. 

3. (Currently Amended) [[A]] Ihe method of claim 1, wherein step (b) does not rely on a 
correlation matrix to account for relationships or interdependencies between the genes. 

4. (Currently Amended) [[A]] The method of claim 2, wherein step (b) comprises 
computing probability distributions from the gene expression data for each gene found in the 
toxin exposed cell or tissue group and the control cell or tissue group. 

5. (Currently Amended) [[A]] The method of claim 2, wherein the toxic response is 
hepatotoxicity. 
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6. (Currently Amended) [[A]] The method of claim 2, wherein the toxin and control groups 
comprise at least about 10 samples. 

7. (Currently Amended) [[A]] The method of claim 2, wherein the toxin and control groups 
comprise at least about 50 samples. 

8. (Currently Amended) [[A]] The method of claim 2, wherein each gene is given equal 
weight as a predictive marker. 

9. (Currently Amended) [[A]] The method of claim 1 , wherein step (b) comprises 
determining a scoring function and a discriminate score for each gene. 

10. (Currently Amended) [[A]] The method of Claim 9, wherein the scoring function fl (Z) is 
generated according to the formula: 
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wherein Z is a cell or tissue sample; 

fl(Z) is the discriminate score for sample Z relative to the samples of group 1; 

ml is an estimator of the true mean for the distribution of gene expression levels for the samples 

of group 1 ; 

m2 is an estimator of the true mean for the distribution of gene expression levels for the samples 
of group 2; 

vl is an estimator of the variance in the distribution of gene expression levels for the samples of 
group 1 ; and 

v2 is an estimator of the variance in the distribution of gene expression levels for the samples of 
group 2, wherein group 1 refers to the test group and group 2 refers to the control group. 
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11. (Currently Amended) [[A]] The method of claim 1 0, wherein a discriminate score for 
each gene is calculated by summing the instances where the scoring function returns a value for 
each sample that is greater than a predetermined significance level. 

1 2. (Currently Amended) [[A]] The method of claim 1 1 , wherein the discriminate score (P) 
for each gene is calculated between the test and control groups by: 

(a) counting the number of times, PI, that fl(Xi) > X A for i =l..t and the number of times, P2, that 
fl(Yj)<'/ 2 forj = l..n;and 

(b) calculating the P discriminate score according to the formula: (PI + P2)/(n + 1). 

1 3 . (Currently Amended) [[A]] The method of claim 1 , wherein the one or more marker 
genes discriminate between the test and control groups. 

14. (Currently Amended) [[A]] The method of claim 1 , wherein the gene expression data 
comprises measurement of the amount or relative amount of mRNA for each gene. 

15. (Currently Amended) [[A]] The method of claim 1 , wherein the one or more markers are 
cross-validated. 

1 6. (Currently Amended) [[A]] The method of claim 1 , wherein the markers stratify a patient 
population. 

1 7. (Currently Amended) [[A]] The method of claim 1 6, wherein the patient population is 
stratified for its response to the administration of a drug or according to a physiological 
characteristic relevant to a clinical trial. 

1 8. (Withdrawn) A method of building a classification database, comprising: 

(a) obtaining gene expression data for at least one test group of cell or tissue samples and 
at least one control group of cell or tissue samples; 
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(b) identifying marker genes by an LDA metric capable of discriminating between the 
test and control groups and storing these markers in the database; 

(c) evaluating the predictive ability of at least one of the markers of step (b); and 

(d) refining the database with information derived from additional biological studies. 

1 9. (Withdrawn) A method of claim 1 8, wherein the database comprises gene expression 
information and meta-data. 

20. (New) The method of claim 1, wherein step (a) comprises hybridizing at least one probe 
to the test cell or tissue group. 

2 1 . (New) The method of claim 1 , wherein step (a) comprises hybridizing at least ten probes 
to the test cell or tissue group. 

22. (New) The method of claim 1, wherein step (a) comprises hybridizing at least one 
hundred probes to the test cell or tissue group. 
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